A Cost-Aware and Workload-Based Index Advisor for Columnar In-Memory Databases

نویسندگان

  • Martin Boissier
  • Timo Djürken
  • Rainer Schlosser
  • Martin Faust
چکیده

Optimal index configurations for in-memory databases differ significantly from configurations for their traditional disk-based counterparts. Operations such as full column scans that have previously been prohibitively expensive in disk-based and row-oriented databases are now computationally feasible with columnar main memory-resident data structures and even outperform index-based accesses in many cases. Furthermore, index selection criteria are different for in-memory databases since maintenance costs are often lower while memory footprint considerations have become increasingly important. In this paper, we introduce a workload-based and cost-aware index advisor tailored for columnar in-memory databases in mixed workload environments. We apply a memory traffic-driven model to estimate the efficiency of each index and to give a system-wide overview of the indices that are cost-ineffective with respect to their size and performance improvement. We also present our Index Advisor Cockpit applied to a real-world live production enterprise system of a Global 2000 company.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Storage Advisor for Hybrid-Store Databases

With the SAP HANA database, SAP offers a high-performance in-memory hybrid-store database. Hybrid-store databases—that is, databases supporting rowand column-oriented data management—are getting more and more prominent. While the columnar management offers high-performance capabilities for analyzing large quantities of data, the row-oriented store can handle transactional point queries as well ...

متن کامل

An Adaptive Aggregate Maintenance Approach for Mixed Workloads in Columnar In-Memory Databases

The mixed database workloads generated by enterprise applications can be categorized into shortrunning transactional as well as long-running analytical queries with resource-intensive data aggregations. The introduction of materialized views can accelerate the execution of aggregate queries significantly. However, the overhead of materialized view maintenance has to be taken into account and va...

متن کامل

Development and Validation of a Pilot Activity Load Index (PALI) based on NASA-TLX template

Abstract Introduction: Workload can be defined as the hypothetical construct that represents the cost incurred by a human operator to achieve a particular level of performance. Each job has specific needs and demands. The better measurement tool assessing that estimate the workload, it’s need to identify the requirements of a task, the circumstances under which it is performed, and the skills,...

متن کامل

WiSeDB: A Learning-based Workload Management Advisor for Cloud Databases

Workload management for cloud databases deals with the tasks of resource provisioning, query placement, and query scheduling in a manner that meets the application’s performance goals while minimizing the cost of using cloud resources. Existing solutions have approached these three challenges in isolation while aiming to optimize a single performance metric. In this paper, we introduce WiSeDB, ...

متن کامل

Using Object-Awareness to Optimize Join Processing in the SAP HANA Aggregate Cache

The introduction of columnar in-memory databases, along with hardware evolution, has made the execution of transactional and analytical workloads on a single system both feasible and viable. Yet, doing analytics directly on the transactional data introduces an increasing amount of resourceintensive aggregate queries which can slow down the overall system performance in a multi-user environment....

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016